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for identification and construction of possible therapeutic compounds in the treatment of various disease conditions. 
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WO 00/14105 PCT/US99/203I9 
Modifications of The VEGF Receptor-2 Protein And Methods Of Use 
TECHNICAL FIELD AND INDUSTRIAL APPLICABILITY OF INVENTION 

5 The present invention discloses the isolation of a key portion of the catalytic 

kinase region of vascular endothelial growth factor receptor 2 or VEGFR-2 through 
cloning, sequencing and x-ray crystallography. Also disclosed is the deletion of 
various amino acid residues from an area of the catalytic region called the kinase 
insert domain (KID). The resulting polypeptide retains comparable in vitro kinase 

10 activity to that of the wild-type KID and is not necessary for the catalytic activity of the 
polypeptide, and more importantly, allows complete crystallization of the protein such 
that it may be characterized by X-ray crystallography. The present invention further 
discloses x-ray crystallography data useful for identification and construction of 
therapeutic compounds in the treatment of various disease conditions associated with 

15 VEGFR-2. 

BACKGROUND OF THE INVENTION 

Many physiological events including embryogenesis, organ development, 
20 estrus, and wound healing require vascular growth and remodeling (Folkman et aL, 
(1992) J. Biol. Chem. 267, 10931-10934; Risau, W. (1995) FASEB J. 9, 926-933.). In 
addition to these beneficial processes, angiogenesis is also involved in the proliferation 
of disease states such as tumor growth, metastasis, psoriasis, rheumatoid arthritis, 
macular degeneration and retinopathy (Pepper, M.S., (1996) Vase. Med. 1, 259-266; 
25 KuiperetaL, (1998) Pharmacol. Res. 37. 1-16, 1998; Kumar and Fidler, (1998) In Vivo 
18, 27-34; Szekanecz et al., (1998) J. Investig. Med. 46. 27-41; Tolentino and Adamis, 
(1988) Int. Ophthalmol. Clin. 38, 77-94. Of the signaling pathways known to influence 
vascular formation, these involving vascular endothelial growth factor (VEGF) haves 
been shown to be essential and selective for vascular endothelial cells (Dvorak et al., 
30 (1995) Am. J. Pat h. 146, 1029-1039; Thomas, K. f (1996) CeN 271, 603-606; Ferrara N. 
and Davis-Smyth, (1997) En docrine Rev. 18, 4-25). The therapeutic potential of 
inhibiting the VEGF pathway has been directly demonstrated by anti-VEGF monoclonal 
antibodies which were active against a variety of human tumors (Borgstrom et al, 
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(1996) Cancer_R.es, 56, 4032-^039) and ischemic retinal disease (Adamis et aL, (1996) 
Ar ch. Ophthalmo l. 114, 66-71). 

Normal vasculogenesis and angiogenesis play important roles in a variety of 
physiological processes such as embryonic development, wound healing, organ 
5 regeneration and female reproductive processes such as follicle development in the 
corpus luteum during ovulation and placental growth after pregnancy (Folkman & 
Shing, 1992). Uncontrolled vasculogenesis and/or angiogenesis has been associated 
with diseases, such as diabetes, as well as malignant solid tumors that rely on 
vascularization for growth. Klagsburn & Soker, (1993) Current Biolog y 3(10):699-702: 
10 Folkham, (1991) J. Natl.. Cancer Inst. 82:4-6: Weidner. et al. f (1991) New Engl. J. 
Med. 324:1-5. 

Several polypeptides with in vitro endothelial cell growth promoting 
activity have been identified. Examples include acidic and basic fibroblastic growth 
15 factor (FGF), vascular endothelial growth factor (VEGF)and placental growth factor. 
Unlike FGF, VEGF has recently been reported to be an endothelial cell specific 
mitogen (Ferrara & Henzel, (1989) Biochem. Biophys. Res. Comm. 161:851-858; 
Vaismanetal., (1990) 1 Biol. Chem. 265:19461-19566). 

20 Thus, identification of the specific receptors to which VEGF binds is 

important to understanding of the regulation of endothelial cell proliferation. Two 
structurally related tyrosine kinases have been identified to bind VEGF with high 
affinity: the flt-1 receptor (Shibuya et aL. (1990) Oncogene 5:519-524; De Vries et aL, 
(1992) S cience 255:989-991) and the KDR/FLK-1 receptor, discussed herein. 

25 Consequently, it had been surmised that RTKs may have a role in the modulation and 
regulation of endothelial cell proliferation. 

Recent disclosures, such as information set forth in U.S. Patent Application 
Ser. Nos. 08/193,829, 08/038,596 and 07/975,750,strongly suggest that VEGF is not 
30 only responsible for endothelial cell proliferation, but also is the prime regulator of 
normal and pathological angiogenesis. See generally, Klagsburn & Soker, (1993) 
Current Biolog y 3:699-702; Houck, et aL, (1992) J. Biol. Chem . 267:26031-26037. 
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VEGF is a homodimeric cytokine that is expressed in at least four splice- 
variant forms of 121-206 residues (Ferrara and Davis-Smyth, 1997). Vascular 
endothelial ceils express at least two high-affinity receptors for VEGF: VEGF-R1/FIM 
and VEGFR-2/KDR. VEGF-R1 and VEGFR-2 are receptor tyrosine kinases each 

5 comprised of an extracellular domain that contains 7 immunoglobulin-like segments 
and binds VEGF, a short membrane spanning region, and a cytosolic domain 
possessing tyrosine kinase activity. The kinase domain directly follows the 
extracellular and juxtamembrane regions and is itself followed by another domain 
(post-kinase domain), which may function in binding of other proteins for signal 

10 transduction. These two receptors appear to have different signaling pathways and 
functions with VEGFR-2 being of primary importance in mitosis of endothelial cells 
(Waltenberger et al., (1994) J. Biol. Chem. 269, 26988-26995; Seetharm et al., 
(1995) Oncogene 70, 135-147; Shalaby et al., (1995) Nature 376, 576-579). 

15 Both FGF and VEGF are potent angiogenic factors which induce formation of 

new capillary blood vessels. Transfection of human breast carcinoma ceil line MCF-7 
with FGF resulted in cell lines that form progressively growing and metastatic tumors 
when injected (s.c.) into nude mice. FGF may play a critical role in progression of 
breast tumors to an estrogen-independent, anti-estrogen resistant metastatic 

20 phenotype (McLeskey et al., (1993) Cancer Re s. 53: 2168-2177). Breast tumor cells 
exhibited increased neovascularization, increased spontaneous metastasis and more 
rapid growth in vivo than did the non-transfected tumors. FGF has been shown to be 
transforming in NIH-3T3 cells and implicated in tumorigenesis and metastasis of 
mouse mammary tumors. FGF overexpression conferred a tumorigenic phenotype on 

25 a human adrenal carcinoma cell line suggesting that FGF's may also play a role in the 
transformation of epithelial cells. Polyclonal neutralizing antibodies to FGF inhibited 
tumor growth in Balb/c nude mice transplanted with K1000 cells (transfected with the 
leader sequence of bFGF) which form tumors in these mice (Hori et al., (1991) 
Cancer Res. 51: 6180-9184). 

30 

Due to the role of FGF in neovascularization, tumorigenesis and metastasis, 
there is a need in the art for FGF inhibitors as potent anti-cancer agents that exert 
their anti-FGF activity by preventing intracellular signaling of FGF. 



3 



WO 00/14105 



PCT/US99/20319 



VEGF, by contrast, is an endothelial cell-specific mitogen and an 
angiogenesis inducer that is released by a variety of tumor cells and expressed in 
human tumor cells in situ. Unlike FGF ? transfection of cell lines with a cDNA 
5 sequence encoding VEGF, did not promote transformation, but did facilitate tumor 
growth in vivo (Ferrara, N., and Davis-Smyth, T. (1997)). Furthermore, administration 
of a polyclonal antibody which neutralized VEGF also inhibited growth of human 
rhabdomyosarcoma, glioblastoma multiforme and leiomyosarcoma cell lines in nude 
mice (Kim et al., (1993) Nature 362: 841-843). 

10 

in view of the importance of receptor tyrosine kinases (RTKs) to the control, 
regulation and modulation of endothelial cell proliferation and potentially 
vasculogenesis and/or angiogenesis, many attempts have been made to identify RTK 
"inhibitors" using a variety of approaches, including the use of mutant ligands (U.S. 

15 Patent No. 4,966,849), soluble receptors and antibodies (Application No. WO 

94/10202; Kendall & Thomas, (1994) Proc. N atl. Acad. Sci . 90:10705-09; Kim, et aL, 
1993), RNA ligands (Jellinek, et al., (1994) Biochemistry 3:10450-56), protein kinase 
C inhibitors (Schuchter, et al., (1991) Cancer Res. 51:682-687); Takano, et al., (1993) 
Mol. Bio. Ce ll 4:358A; Kinsella, et al., (1992) Em_Cell Res ., 199:56-62; Wright, et al., 

20 (1992) J Cellular Phvs. 152:448-57) and tyrosine kinase inhibitors (WO 94/03427; 
WO 92/21660; WO 91/15495; WO 94/14808; U.S. Pat. No. 5,330,992; Mariani, et al., 
(1994) P roc. Am. A ^or. Cancer Res. 35:2268). 

More recently, attempts have been made to identify small molecules which 
25 act as tyrosine kinase inhibitors. For example, bis monocyclic, bicyclic or heterocyclic 
aryl compounds (PCT WO 92/20642), vinylene-azaindole derivatives (PCT WO 
94/14808) and 1-cycloproppyl-4-pyridyl-quinolones (U.S. Pat. No. 5,330,992) have 
been described generally as tyrosine kinase inhibitors. Styryl compounds (U.S. Pat. 
No. 5,217,999), styryl-substituted pyridyl compounds (U.S. Pat. No. 5,302,606), 
30 certain quinazoline derivatives (EP Application No. 0 566 266 Al), selenoindoles and 
selenides (PCT WO 94/03427), tricyclic polyhydroxylic compounds (PCT WO 
92/21660) and benzylphosphonic acid compounds (PCT WO 91/15495) have been 
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described as compounds for use as tyrosine kinase inhibitors for use in the treatment 
of cancer. None of these compounds, however, have been previously associated 
with the enzymatic function of the VEGFR-2 receptor. Likewise, none of these 
compounds have been associated with regulation of vasculogenesis and/or 
5 angiogenesis. 

Therefore, there is a need in the art to develop small molecule antagonists of 
the PDGF, FGF, EGF and VEGF pathways individually or as a group. Moreover, if 
these cytokines signal through a common second messenger pathway within the cell, 

10 such antagonists will have broad therapeutic activity to treat or prevent the 
progression of a broad array of diseases, such as coronary restenosis, tumor- 
associated angiogenesis, atherosclerosis, autoimmune diseases, acute inflammation, 
certain kidney diseases associated with proliferation of glomerular or mesangial cells, 
and ocular diseases associated with retinal vessel proliferation. The present invention 

15 was made by discovering a common signaling mechanism, a group of active 

therapeutic agents, shown to be active by a large number of and variety of predictive 
assays, and discovering a common intracellular signaling intermediate. 

Based on sequence homology and overall domain structure, VEGFRs belong 
20 to the platelet-derived growth factor receptor family (PDGFR) which also includes 
PDGFR , PDGFR , the stem cell growth factor receptor (c-kit), and the colony 
stimulating factor-1 receptor (CSF-1R/c-fms) (van der Geer et ah, (1994) Ann. Rev. 
Cell Biol. 70, 251-337). Compared to other protein kinases, members of this family 
contain an insert of approximately 65-97 residues, termed the kinase insert domain 
25 (KID), within the catalytic kinase domain relative to other protein kinases. Within the 
PDGFR family the KIDs are of varying length and low sequence homology. Deletion 
or mutation of the KID from PDGFR , PDGFR , c-kit, and CSF-1R have indicated that 
this domain is not necessary for intrinsic kinase activity but that it is important for the 
binding of other proteins involved in signal transduction, via autophosphorylation of 
30 KID tyrosine residues (Taylor et al., (1989) EMBO J. 8, 2029-2037; Heidaran et al., 
(1991) MoL Cell. Biol. 77. 134-142; Yu et al.. (1991) Mol. Cell. Bio l 77,3780-3785; 
Kazlauskas et al., (1992) MoL Cell. Bio l. 72, 2534-2544; Lev et al., (1992) Proc. Natl. 
Acad. Sci. USA 89. 678-682; Reedjik et al., (1992) E MBO J . 77, 1365-1372; 
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Bazenet et al., (1996) MaL Ce"- Biol. 16, 6926-6936). Although the signaling 
pathways and the specific role of the KID are still not fully determined for VEGFRs, 
the VEGFR-2 KID does contain two tyrosines which are known to be 
autophosphorylation sites (Dougher-Vermazen et al.. (1994) Biochem. Biophys. Res. 
5 Comm. 205, 728-738). 

Since the determination of the first cyclic AMP-dependent protein kinase 
(cAPK) structure (Knighton et al., (1991) Science 253, 407-413) a variety of protein 
kinase structures have been reported (reviewed in Johnson et al., (1996) CeJJ 85, 
1 0 149-1 58). Among the receptor protein tyrosine kinases (RTKs), structures of the 
kinase domain of the insulin receptor (IRK) (Hubbard, et al., (1994) Nature 372, 746- 
754; Hubbard, (1997) EMBQ J. 16, 5572-5581) and the fibroblast growth factor 
receptor-1 (FGFR1) (Mohammadi et al., (1996) CeJl 86, 577-87; Mohammadi et al., 
(1 997) Science 276, 955-960) have been determined. 

15 

SUMMARY OF THE INVENTION 

The present invention discloses the generation, kinetic characterization, and 
20 structure determination of a modified kinase domain of the VEGFR-2 protein, 
containing 18 residues of the 68 residue KID. This 2.4 A crystal structure of the 
phosphorylated VEGFR-2 catalytic domain is the first reported structure of a kinase 
domain of the PDGFR family. This structure provides insights into the orientation of 
the KID domain of VEGFR-2 which may be relevant to other PDGFR family members. 
25 Furthermore, as inhibition of VEGFR-2 kinase has broad clinical applications, this 
structure provides a three-dimensional description of the target for structure-based 
design of small molecule VEGFR-2 inhibitors as therapeutic agents. 

It is an object of the present invention to disclose an effective method for 
30 screening candidate compounds that are specifically agonists or antagonists of 

various proteins which can be included in the receptor tyrosine kinase family (RTK) by 
crystallizing RTKs and particularly the VEGFR-2 receptor in order to use molecular 
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modeling of the x-ray crystallography data to model the binding of candidate 
compounds. 

There is disclosed a method for designing and screening potentially 

5 therapeutic compounds with activities such as: (1) inhibiting new blood vessel 

formation that is useful for treating or preventing progression of diabetic retinopathy, 
cavernous hemangiomas, Kaposi's sarcoma, tumors composed of endotheliai-iike 
cells, and growth of cancer cells by preventing their development of a new blood 
supply: (2) suppressing development of kidney diseases due to cytokine induced 

10 proliferation of mesangial cells and/or glomerular epithelial cells that is useful for 
treating or preventing progression of diabetic glomerulosclerosis and other 
glomerulonephritis of various types and etiologies; (3) preventing joint destruction 
accompanying rheumatoid arthritis due to proliferation of synovial cells; (4) 
suppressing manifestations of psoriasis due to proliferation of keratinocytes and 

15 accumulation of inflammatory cells; (5) suppressing accelerated atherogenesis 
involved in restenosis of coronary vessels or other arterial vessels following 
angioplasty; (6) suppressing atherogenesis, coronary artery disease and other 
vasculopathies due to atherogenesis; and (7) suppressing tumor growth via paracrine 
or autocrine mediated responses to other cytokines such as PDGF, FGF EGF or 

20 VEGF that is useful for treating or preventing progression of tumors such as breast 
cancer stimulated through overexpression of her-2-neu receptor, wherein the 
inventive method comprises administering a compound that inhibits signal 
transduction. 

25 The present invention is useful in developing methods that are used in the 

iterative drug design process. The process identifies potential agonists and 
antagonists to VEGFR-2 by de novo design of novel drug candidate molecules which 
bind to the VEGFR-2 receptor to improve their potency. The x-ray crystaliographic 
coordinates disclosed herein, will allow generation of 3-dimensional models of the 

30 catalytic site and drug binding site of the VEGFR-2 protein. 

De novo design primarily consists of the generation of molecules via the use 
of computer programs which build and link fragments or atoms into a site based upon 
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* steric and electrostatic complementarity, without reference to substrate analog 
structures. The drug design process begins after the structure of a target RTK is 
solved to at least a resolution of 2.8 A. Refinement of the structure to a resolution of 
2.5 A or better, with "fixed" water molecules in place provides more optimal conditions 
5 to undertake drug design. 

It is another object of this invention to identify KIDs of proteins in the RTK 
family and develop deletions in said KIDs such that the proteins will be crystallizable 
and suitable for measurement by x-ray crystallographic means. 

10 

It is a further object of this invention to disclose a process whereby KID 
regions from a member of the RTK family of genes such as PDGF, EGF, VEGF and 
others are modified by deletion of amino acids from the KiD regions so as to impart 
favorable physical characteristics of the resulting polypeptide product. Examples of 
15 such favorable physical characteristics are increased solubility, greater stability to 
temperature variations making the polypeptide suitable for analysis by nuclear 
magnetic resonance, high throughput screening, biochemical characterizations, x-ray 
crystallography, calorimetry and other diagnostic means. 

20 It is yet another object of this invention to developing screening methods used 

in the drug design process of potential agonists and antagonists to proteins in the 
RTK family by de novo design of novel drug candidate molecules with potentially 
nanomolar potencies. The x-ray crystallographic coordinates disclosed based on the 
deletion mutated KIDs and various other deletions of said proteins in the RTK family, 

25 will allow generation of 3-dimensional models of the active binding site of the proteins 
in the RTK family. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 Figure 1 . Secondary structure assignments (as given by Procheck) for the catalytic 
domain of VEGFR2 and sequence alignment with other representative receptor 
tyrosine kinases, helices are designated as B- 1, p strands are designated as 1-8. 
The site of 50 residue deletion in VEGFR2 50 is indicated by |. The site of the E990V 
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mutation in VEGFR2D50 is denoted by an *. Sequences are from: VEGFR2 
(reported here); FGFR1 (Swiss protein database #P11362); IRK (EMBL protein 
database #A18657; numbering as in Mohammadi et a!., 1996); VEGFR1 (Swiss 
protein database #P17948); PDGFR (Swiss protein database #P17948). 

5 

Figure 2. Overall fold of VEGFR2 50P, FGFR1, and IRKP. 
Backbone representation of structures of the kinase domains of (A) VEGFR2 
(VEGFR2 50P), (B) FGFR1 (molecule A of PDB entry 1FGK, Mohammadi et al., 
1996), and (C) IRKP (PDB entry 1IR3, Hubbard et al., 1997). The views shown in A, 
10 B, and C are identical views generated from superpositions of the C-terminal 
domains. The positions of the termini are denoted by N and C. The nucieotide- 
binding loop (orange), kinase insert domain (pink), and activation loop (yellow) are 
highlighted. In (C) the bound AMP-PNP is shown in green and the peptide substrate 
is shown in red. Figure prepared with INSIGHT II. 



15 



20 



25 



30 



Figure 3. Catalytic site of VEGFR2 50P and IRKP. 

Cross section of the catalytic site of (A) VEGFR2 50P and (B) IRKP (PDB entry 1 1R3; 
Hubbard et al., 1997) structures. Atoms are colored by element type: carbon (green), 
oxygen (red), nitrogen (blue), sulfur (yellow), phosphorous (pink).and magnesium ion 
(orange). (A) includes only protein atoms. (B) includes protein atoms, AMP-PNP 
atoms, and Mg2+ ions. Figure generated using INSIGHT II. 

Figure 4. Nucleotide binding site of VEGFR2 50P and FGFR1 . 
Stereo view showing C trace and some sidechains of a superposition of the nucleotide 
binding sites of the VEGFR2 SOP and the FGFRI-(AMP-PCP) complex (molecule B, 
Mohammadi et al.. 1996) structures. The superposition was done using C positions of 
helices (D.E.F.G.H, and I) of the C-terminal lobes. Carbon atoms of VEGFR2 50P are 
shown in yellow and carbon atoms of FGFR1 are shown in purple. The coloring for 
other protein atoms is: oxygen (red), nitrogen (blue), and sulfur (green). The AMP- 
PCP in the FGFR1 structure is depicted in orange. Labels correspond to VEGFR2 50P 
residues. Figure created with Xfit (McRee et al., (1992) J. Mol. Grap h w, 44-46.). 

Figure 5. Electron density map of the kinase insert domain area of VEGFR2 50P. 
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Stereo view of a 2F 0 -F C map computed at 2.4A and contoured at 1 .2 and 
superimposed with the refined model. Carbon atoms are yellow, oxygen atoms red, 
and nitrogen atoms are blue. Water molecules are depicted as red crosses. Figure 
created with Xfit (McRee et al., 1992). 

5 

Figure 6. Kinase insert domain of VEGFR2 SOP. Stereo cross section showing the 
ordered residues of the kinase insert domain of VEGFR2 SOP. Carbon atoms are 
yellow, oxygen atoms are red, nitrogen atoms are blue, and sulfur atoms are green. 
View is rotated roughly 180° from Figure 5. Figure created with Xfit (McRee et al., 
10 1992). 

Figure 7. Resulting X-ray crystallography coordinates for VEGFR-2 based on the 
method disclosed in the crystallization and data collection section. 

15 DETAILED DESCRIPTION AND PREFERRED EMBODIMENTS OF THE 

INVENTION 

Cloning of The VEGFR-2 Protein 

20 The coding sequence (Terman et al., (1992) Biochem Biophvs. Res. Commun 

787, 1579-86) for the cytoplasmic domain of the VEGFR-2 was amplified by PCR 
(Mullis et al., (1986). Biotechnology 24, 17-27) from a human aorta cDNA pool 
(Clontech Palo Alto, CA ). Two overlapping sequences were amplified independently. 
Vcyt (residues M806-V1356), which represented the entire cytoplasmic domain, and 

25 Vcat (residues C817M-G1 191 ), with boundaries based upon a primary amino acid 
sequence alignment with the insulin receptor kinase catalytic domain (Wei et al., 
(1995) J. Biol. Chem. 270, 8122-8130). 

The PCR oligonucleotide primer sequences for Vcyt were: 
30 Vcyt5 5'-CAGCATATGGATCCAGATGAACTCCCATTGG-3' (Seq. ID No. 1) and 
Vcyt3 5-GCGGTCGACTTAAACAGGAGGAGAGCTCAGTGTG-3' (Seq. ID No. 2). 

The PCR oligonucleotide primer sequences for the Vcat were: 
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VcatS 5 , -GCACATATGGAACGACTGCCTTATGATGCCAGC-3 , (Seq. ID No. 3) and 
Vcat3 5-CCTGTCGACTTATCCAGAATCCTCTTCCATGCTCAAAG-5' (Seq. !D No. 4). 

The amplified DNA was digested with the restriction enzymes Ndei and Sail, 
5 ligated into the £ coli plasmid pET24a (Novagen Madison, Wl) and sequence verified. 
When compared to the original VEGFR-2 sequence in Genbank, (Accession number 
346345) two nucleotide differences were noted that resulted in codon changes 
(Glu848-Val and Asn835-Lys) in both Vcyt and Vcat. Our sequence agrees with 
subsequent VEGFR-2 Genbank submissions (Accession numbers 2655412 and 
10 3132833). 

Mutations were introduced by oligonucleotide site directed mutagenesis 
(Kunkel, 1985) using the Muta-Gene in vitro Mutagenesis Kit from (Bio-Rad Hercules, 
CA). The Vcat DNA fragment was subcloned from the pET24a vector using an Ndel- 

15 Xhol digest into the vector pMGH4 (Schoner et al., 1986, Kan et a!., 1992) and this 
vector was used to generate the ssDNA uracil template (minus strand) in E. coli strain 
CJ236 supplied in the kit. An oligo (S'-CTCAGCAGGATTGATAAGACTACATTGTTC- 
3') was designed to create a construct (Vcat( G1172-G1191)) which truncated the C- 
terminus to residue D1 171 . Another oligo (5'- 

20 GAATTTGTCCCCTACAAGGAAGCTCCTGAAGATCTG-3') was designed to delete 
the central 50 residues (residues T940-E989) of the insert kinase domain, based on a 
sequence alignment with FGFR1 (Mohammadi et al, 1996). Sequence analysis 
detected an inadvertent Glu990-Val mutation. All DNA modification and restriction 
enzymes were purchased from New England Biolabs and oligonucleotides were 

25 purchased form Genosys Biotechnology. 

The VEGFR2 50 construct was made in several steps to combine the necessary 
mutations into the baculovirus expression vector pAcSG2 (Pharmingen San Diego, CA). Step 
1; the coding region for Vcyt was PCR subcloned from the pET24a vector into the Ncol-Kpnl 
30 sites of vector pAcSG2. Step2; a 2358bp Scal-Bglll DNA fragment from plasmid pMGH4-Vcat 
( T940-E989.E990V) was ligated to a 1695bp Bglll-Scal DNA fragment from pMGH4-Vcat 
( G1172-G1191) creating a pMGH4-Vcat ( T940-E989,E990V, G1172-G1 191) vector. Step 3; a 
913bp BstEII-Eagl DNA fragment a pMGH4-Vcat ( T940-E989,E990V, G1172-G1 191) was 
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Sigated to a 3290bp Eag!-BstE!i DNA fragment from pAcSG2-Vcyt creating pAcSG2-Vcyt 
( T940-E989.E990V, G1 172-G1 191), also referred to as VEGFR2 50. This final construct was 
sequenced verified through the entire coding region and confirmed to contain only these known 
mutations from the wild-type sequence (sequence shown in Figure 1). 

5 

DNA encoding VEGFR2 50 was transfected into Sf9 cells with linearized 
baculovirus DNA according to the protocol of the manufacturer (Pharmingen San 
Diego, CA). Single plaques were isolated from this transfection and high titer stocks 
generated. All stocks were examined by isolation of baculoviral DNA and PCR 
10 amplification of the insert using the polyhedron forward and reverse primers 

(Invitrogen). Sf21 cells were infected at 1-1.5 million cells/mL at MOI=5 for 72 hours 
and harvested by centrifugation. 

Purification Of VEGFR2 50 From Sf21 Cells 

15 

Cell pellets were lysed by dounce homogenization and sonication in 20 mM 
Tris pH 8.0, 20 mM NaCI, 5 mM DTT, and 5% (v/v) glycerol. The lysate was 
centrifuged for 50 minutes at 35,000 rpm in a Ti45 rotor. The soluble fraction was 
loaded onto a 40 ml Q-30 anion exchange column (Pharmacia) and eluted with a 20 

20 mM to 600 mM NaCI gradient in 20 mM Tris pH 8.0, 5 mM DTT, and 5% (v/v) glycerol 
over 20 column volumes. VEGFR2 50 protein was pooled by SDS-PAGE gel analysis 
and by the presence of kinase activity as measured against gastrin substrate peptide 
substrate (Boehringer Mannheim). Pooled material was loaded onto a 40 mL 
hydroxyapatite (Bio-Rad) column and washed extensively with 20 mM Tris pH 8.0, 50 

25 mM NaCI, 5 mM DTT, and 5% glycerol. Protein was eluted using a 500 mL linear 

gradient from 0 to 50 mM potassium phosphate pH 8.0, 50 mM NaCI, 5 mM DTT, and 
5% glycerol. VEGFR2 50 protein was pooled by SDS-PAGE gel analysis and by the 
presence of kinase activity as measured against the gastrin peptide. Material from 
this column was then diluted 1:1 with 20 mM Tris pH 8.0, 20 mM NaCI, 5 mM DTT, 

30 and 5% glycerol and loaded onto an 8 mL Q-15 anion exchange column (Pharmacia). 
Protein was eluted using with a 180 mL linear NaCI gradient (20 mM-175 mM) in 20 
mM Tris pH 8.0, 5 mM DTT, and 5% glycerol. VEGFR2 50 protein was pooled as 
described above. 4M (NH4)2S04 was added to the pool to final concentration of 0.6 
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M and the pool loaded onto a'.OmL HP-phenyi sepharose column (Pharmacia). 
VEGFR2 50 protein was eiuted using a 200 mL linear reverse gradient from 0.6 M to 0 
M (NH4)2S04 in 20 mM Tris and 5 mM DTT. Purified VEGFR2 50 protein was buffer 
exchanged into 50 mM Hepes pH 7.5, 10 mM DTT, 10 % glycerol, and 25 mM NaCI 
5 over a 500 mi G-25 column (Pharmacia) and concentrated to 1 mg protein/mL 

through a 10 kD cutoff polysulfone membrane (Amicon). Final material was aiiquoted 
and flash frozen in liquid N2 and stored at -70'C. 

Kinetic Assay s 

10 

The coupled spectrophotometric assays were done with purified VEGFR2 50 
protein that was autophosphorylated under conditions: protein (4 mM), ATP (3 mM), 
MgCl2 (40 mM), DTT (5 mM), in Hepes (100 mM), 10% glycerol, pH 7.5 at 4 °C for 1 
hour. 

15 

Coupled Spectrophotometric Assay for the Forward Direction 

Tyrosine kinase assays were monitored using a Beckman DU 650 
Spectrophotometer. Production of ADP was coupled to oxidation of NADH using 
20 phosphoenolpyruvate (PEP) through the actions of pyruvate kinase (PK) and lactic 
dehydrogenase (LDH). The oxidation of NADH was monitored by following the 

-1 -1 

decrease in absorbance at 340 nm (e34o=6.22 cm mM ). Typical reaction 
solutions contained: 1 mM PEP, 250 mM NADH, 50 units of LDH/mL, 20 units of 
PK/mL, 5 mM DTT, in 200 mM Hepes, pH 7.5 and varying concentrations of 
25 poly(E4Yi) (Sigma), ATP and MgCl2- Assays were initiated with 40 nM of 
VEGFR2 50 protein. 

Coupled Spectrophotometric Assay for the Reverse Reaction 

30 ATP generation was coupled to production of NADH via the action of 

hexokinase (HK) and glucose-6-phosphate dehydrogenase (G6PD). In this assay, 
HK catalyzes the conversion of ATP to ADP and glucose-6-phosphate. Glucose-6- 
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phosphate is then oxidized to D-6-phosphogluconapyranose-l f 5-Iactone by G6PD 
with concomitant reduction of NAD to NADH which can be monitored at 340 nm, 
Typical assay solution contained: glucose (10 mM), NAD (40 mM), DTT (5 mM), 
MgCl2 (4 mM), HK (15 unit/mL), G6PD (15 units/mL) and indicated concentrations of 
5 ADP and phospho-poly(E4Y). The reactions were initiated with addition of 
VEGFR2 50 protein (600-900 nM). 

Evaluation of Potential Agonists and Antagonists of the VEGFR2 50 Protein 

10 Based on the above spectrophotometric and kinetic assays, one can evaluate 

potential candidate agonists or antagonists of the VEGFR2 50 protein by addition of 
the candidate compounds to the above assay in a competition. As stated above, the 
kinetics of the activity of the VEGFR2 50 protein were measured against the gastrin 
peptide. The activity in the presence and absence of a candidate compound is 

15 measured and the resulting kinetic data is compared. The affinity of the candidate 
for the receptor will be reflected in the shift to the right of the kinetic curves indicating 
a competitive antagonist or with a decrease in the maximum activity, which would 
indicate a non-competitive antagonism. Conversely, a shift to the left of the kinetic 
curves would indicate a competitive agonist to the VEGFR2 50 protein. See generally, 

20 Bourne, H.R., et al. in, (1987) Basic & Clinical Pharmacology (Katzung, et al., eds) 
(Ch. 3) 9-22. 

In Vitro Autophosphorvlation Of VEGFR2 50 For Crystallization And Mass 
Spectrometry , 

25 

Aliquots of frozen VEGFR2 50 protein were thawed by immersion in cold H2O 
and pooled at 4'C. MgCl2 and ATP were added to 26 mM and 4 mM, respectively. 
VEGFR2 50 was incubated at 4'C for 1 hour. This material (VEGFR2 SOP) was then 
buffer exchanged into a solution of 10 mM Hepes 7.5, 10 mM DTT, and 10 mM NaCI 
30 and concentrated using a Centriprep-10 (Amicon) to 5 mg protein/mL 

Mass spectrometry 
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Trypsin digestion:. Trypsin digestions of purified VEGFR2 50 and VEGFR2 SOP 
were conducted at 37°C suing 0.37 mg/ML protein in 25 mM NH4HCO3 at pH. 7.7 
with a reaction volume of 100 L for two days. 

5 MALDI/MS. MALDI-MS analyses were performed in a Voyager-Elite, tirne-of- 

flight mass spectrometer with delayed extraction (PerSeptive Biosystems, Inc., 
Framingham, MA). A volume of 1 L of digested protein sample was mixed with 1 L of 
matrix (a-cyano-4-hydroxy-cinnamic acid) in a solution of 50% (v/v) solution of 
acetonitrile and 0.25% (w/w) trifluoroacetic acid in water. Samples were irradiated 
10 with a nitrogen laser operated at 337 nm. 

NanoESI-MS. NanoESI-MS analyses were performed on a triple quadrapole 
mass spectrometer (PE Sciex API III, Alberta, Canada) modified with a NanoESI 
source from Protana A/S, (Denmark). The ESI voltage was set at 700 V and the 
15 orifice settings were maintained at 100 V. 3 L of digested protein was mixed with 7 L 
of methanol and 0.5 L formic acid and then 4 L of this sample was injected into the 
mass spectrometer. Ion scans were used to obtain the sequence of phospho- 
peptides. 

20 Crystallization and Data Collection 

Purified phosphorylated VEGFR2 50 was concentrated on average to 5 mg 
protein/mL using a Centricon-10 centrifugal concentrator. Crystals were grown by the 
hanging drop vapor diffusion method at 4°C. Drops containing 2 L of protein solution 
25 and 2 L of a mother liquor solution (100 mM Hepes at pH 7.2, 2 M (NH4)2S04 } , and 
2% (v/v) monomethylether polyethylene glycol mW=550) were equilibrated above a 1 
mL reservoir of the mother liquor solution to which 50 mM -mercaptoethanol had 
been added. Crystals appeared after 3-4 days and grew to as large as 0.3 x 0.2 x 0.5 
mm over 21 days. 

30 

X-ray diffraction data sets were collected using a Rigaku RU-200 rotating 
anode X-ray generator (CuK ) operated at 50 kV and 100 mA and equipped with 
Supper focusing mirrors and a MAR345 MAR Research image plate detector. Data 
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collection on frozen crystals was done by transferring a crystal into a cryoprctectant 
solution {100 mM Hepes at pH 7.2, 2.2 M (NH4)2S04 f 0.6 M sucrose, 0.55 M 
glucose, and 2% (v/v) rnonomethylether polyethylene glycol MW=550), flash freezing 
the crystal in liquid nitrogen, and then transferring the frozen crystal into a stream of 
5 nitrogen at -186 °C. Data was integrated and scaled using DENZO and SCALEPACK 
(Otwinowski, 1993) Data collection statistics are given in Table 2. 

Initial protein phases were obtained using the AMoRe molecular replacement 
program (Navaza, 1994), molecule 1 of the FGFR1 structure (Mohammadi et ah, 
1996; PDB entry 1FGK) as a search probe, and the nativel data set. The correct 
solution was achieved by including the FGFR1 sidechains and removing from mobile 
residues of the activation loop (640-660), the N-terminus (464-467), a short loop 
(517-520), and the C-terminus (760-762) from the search model. The correct solution 
was the top peak in the rotation and translation functions with a correlation coefficient 
of 0.31. Rigid body refinement in AMoRe improved the solution to a correlation 
coefficient of 0.49 and an R-factor of 46.3% in the 12.0 - 4.0 A resolution range. The 
correctness of this solution was cross-checked by calculation of a difference Fourier 
with a KAu(CN)2 derivative. This derivative was generated by soaking a crystal for 3 
days in reservoir solution containing 0.5 mM KAu(CN)2 and then increasing the heavy 
atom concentration to 5 mM and soaking for an additional 64 hours. Scaling of data 
sets, Patterson calculations, Fourier calculations, and the generation of phases were 
done using Xtalview (McRee et aL 1992) 

Refinement of the model was done using Xplor version 3.1 (Brunger, 1992). 
25 Calculation of electron density maps and model fitting was done using XtalView 
(McRee et al., 1992) Refinement was begun using a data set collected at 4 °C 
(native2) and was completed using a data set (native3) collected at -186 °C. The 
final R-factor is 20.2% for data in the range 8-2.4 A (Fo > 25 ). The average B value 

for all atoms is 31 .8 A 2 for protein atoms and 4^.3 A 2 for water molecules. The final 
30 model includes residues 820-939, 998-1047, and 1064-1 168; of these residues the 
sidechains of K838, R842, F845, K939, D998, K1023, R1027, Y1038, K1039, K1110, 
and E1113 could not be modeled beyond C due to a lack of interpretable density. 
Analysis of main-chain torsion angles as done using PROCHECK (Laskowski et al., 
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1993) shows of the 275 residues in the model none occur in the disallowed region 
and only 4 occur in the generously allowed region of a Ramachandran plot. 182 water 
molecules were fit to electron density peaks which were greater than 38 and were 
located in positions to make reasonable hydrogen bonds to the protein or other water 
5 molecules. 

Superpositions of various kinase structures was done using the graphics 
program Insight II (Molecular Simulations Inc. San Diego, CA). 

10 Example 1- Structure Determinations 

The tyrosine kinase domain of human VEGFR-2 lacking the 50 central 
residues of the 68 residues of the KID was expressed in a baculovirus/insect cell 
system. Of the 1356 residues of full-length VEGFR-2 this construct (VEGFR2 50) 
1 5 contains residues 806-939 and 990-1 1 71 of the cytosolic domain (Figure 1 ). 

VEGFR2 50 also contains one point mutation (E990V) within the KID relative to wild- 
type VEGFR-2. 

In addition to catalyzing its autophosphorylation, VEGFR2 50 is also capable of 
20 catalyzing phosphorylation of a poly(E4Y) exogenous substrate. Detailed kinetic 

analysis (Table 1) revealed that its kinetic parameters were nearly identical to that of 
a comparable VEGFR-2 protein construct containing the entire KID (Parast et al., in 
press). These results taken together indicate that VEGFR2 50 is a fully active 
functional enzyme. Therefore, deletion of 50 central residues of the KID has no 
25 observed effect on the catalytic steps of the phosphotransfer reaction. It was also 
determined that deletion of more than 60 amino acids from the KID region did cause 
a diminishment in the activity of the enzyme. 
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5 Forward Reaction 



Substrate 


Km (mM) 


/feat (s~ 1 ) 


/feat/ K M (s-V 1 ) 


MgATP 


0.153 


13.3 


87 x 10 3 


10 poly(E4Y) 


2.1 




63 x 10 2 


Mg 2+ 


6.8 




20 x 10 2 



15 



Reverse Reaction 



Substrate 


Km (mM) 


/feat (s" 1 ) 


Acat/K/U (s~V 1 ) 


MgADP 
P-poly(E 4 Y) 


0.056 
1.0 


0.13 


23 x 10 2 
13x10 1 



25 

The VEGFR-2 KID sequence is hydrophilic and highly charged, containing 6 
lysine, 5 arginine, 8 glutamic acid, and 5 aspartic acid residues (Figure 1). Initially 
several protein constructs containing the VEGFR-2 catalytic domain with the entire 

30 KID were generated. After exhaustive attempts to crystallize these protein constructs 
failed to yield even marginal crystals, the VEGFR2 50 construct was created to test 
the idea that the highly charged KID was interfering with crystallization. As 
determined by dynamic light scattering this VEGFR2 50 construct, which eliminated 14 
charged residues, exhibited markedly better stability to temperature and protein 

35 concentration than protein constructs containing the entire KID. 

For crystallization, purified VEGFR2 50 was autophosphoryiated in vitro by 
incubation with MgATP. Matrix-assisted laser desorption ionization (MALDI) and 
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nanoelectrospray ionization (NanoESI) mass spectrometry analysis of full-length 
phosphorylated VEGFR2 50 (VEGFR2 SOP) protein and tryptically digested peptides 
indicates phosphorylation of Y1059 using the autophosphorylation conditions 
described here. Crystals diffracting to 2.2 A were obtained of VEGFR2 SOP in an 
5 unligated state. The crystals belong to the orthorhombic space group P2-|2i2i with 
one VEGFR2 50P molecule in the asymmetric unit. Initial crystaliographic phases 
were determined by molecular replacement using the structure of the 
unphosphorylated kinase domain of FGFR1 (Mohammadi et al., 1996) as a search 
model. The correctness of the molecular replacement solution was cross-checked 

10 using a gold cyanide derivative. The derivative data, however, was not used for 
phase calculations of electron density maps used to build the model. The structure 
has been refined to an R-factor of 20.2% for 8-2.4A data (Fo > 25). 
VEGFR2 50P residues for which backbone atoms were not modeled due to disorder 
include the N-terminal residues 806-819, residues 990-997 of the KID, residues 1048- 

15 1063 of the activation loop, and residues 1 169-1 171 of the C-terminus. Structure 
determination statistics are included in Table 2. 

Table 2: VEGFR2 50P structure determination statistics 

20 



Data Set 


Native (3) 


Native (1) 


Native (2) 


KAu(CN)2 


Data resolution (A) 


15-2.2 


20-3.0 


15-2.4 


15.3.1 


ftsym (%) 


5.2 a (19.6) b 


8.4 (19.2) 


7.0 (21.9) 


7.1 (19.5) 


Completeness (%) 


93.0 (81.0) 


97.5 (98.4) 


98.8 (98.8) 


96.5 (95.0) 


Temperature (°C) 


-186 


room (-21 ) 


4 


4 


Unit cell a (A) 


95.41 


97.10 


98.52 


97.71 


Unit cell b (A) 


96.04 


96.94 


96.50 


96.97 


Unit cell c (A) 


38.22 


38.63 


38.56 


38.52 


Refinement 


8-2.4 








resolution (A) 










Refined R (%) 


20.2 c,d 









3 Rsym = hkl i|/i (hkl)-</(hkl)>| / hkl \l\ (hkl) 
25 b Value in parenthesis is for highest (resolution shell) 
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° R = hkii \Fom)\-\F c (hkf)| | / hkl \F 0 (hkl)j 

where Fo and F c are the observed and calculated structure factors, 
respectively (F 0 > 2 ) 

5 

d 

Model includes 275 protein residues and 182 water molecules 
Overall Kinase Fold 

10 

Analogous to previously reported structures of both serine/threonine and 
tyrosine protein kinases, VEGFR2 SOP is folded into two lobes with catalysis of 
phosphotransfer taking place in a cleft between the two lobes (reviewed in Cox et al., 
1994; Johnson et al., 1996) A C trace of the VEGFR2 50P structure is shown in 

15 Figure 2a. Kinase secondary structural elements are designated (Figure 1) according 
to the convention originally given for cAPK (Knighton et al., 1991 ). The N-terminal 
lobe (approximately residues 820-920) folds into a twisted sheet with one helix ( C). 
The structure comprises five antiparallel strands ( 1-5), three of which (1-3) are highly 
curved and curl over the other two strands ( 4- 5). The larger C-terminal domain 

20 (approximately residues 921-313) contains two antiparallel strands ( 7- 8), which lie at 
the top of the C-terminal domain adjacent to the N-terminal -sheet. Seven -helices 
( D, E, E-F, G, H, I) form the remaining core of the C-terminal domain. Like other 
kinases, VEGFR2 SOP contains two functionally important loop regions: the glycine- 
rich nucleotide binding loop (residues 841-846), the catalytic loop (residues 1026- 

25 1033) and the activation loop (residues 1046-1075) (Figures 1 and 2a). 

Of the reported kinase structures, the VEGFR2 SOP structure resembles most 
closely that of the catalytic domain of FGFR1 (Mohammadi et al., 1996; PDB entry 
1FGK) with which it shares approximately 55% sequence identity (Figure 1). Since 
30 the two molecules in the crystallographic asymmetric unit of the FGFR1 structure 

solution are very similar, comparisons to VEGFR2 50P will primarily be described only 
for FGFR1 molecule A. Least squares superposition of 82 C positions of ( 1- 5) of the 
N-terminal lobe or 152 C positions residues ( D, E, F, G, H, I ) of the C-terminal lobe 
between FGFR1 and VEGFR2 SOP result in respective rms deviations of 0.40 A and 
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0.52 A. A relative rotation of approximately 5° between the two lobes results in the 
interlobe cleft of VEGFR2 50P being slightly larger and more open. Measurement of 
distances between equivalent C 's (K523 and R675 of FGFR1 , S877 and R1080 of 
VEGFR2 SOP) at the ends of the cleft reveal that this distance is 25.3 A in 
5 VEGFR2 SOP as compared to 23.2 A in FGFR1. This is however a minor difference, 
as compared to much larger relative lobe rotations observed among kinase structures 
in various ligation and phosphorylation states (Johnson et al., (1996) CelJ 85, 149- 
158). For example, the inter-lobe orientation seen here for VEGFR2 SOP is in an 
approximately 20° more open conformation than that seen in the ternary complex 
10 structure of the phosphorylated kinase domain of IRK bound to the ATP analog AMP- 
PNP and a peptide substrate (Hubbard, (1997) EMBO J. 16, 5572-5581; PDB entry 
1IR3) (Figure2c). 

While the -strand positions of the N-terminal lobe agree well between 
15 VEGFR2 50P and FGFR1, the structures do diverge significantly at the N-terminal 
residues preceding the first conserved region starting at residue W827 (Figure 2a and 
2b). The first 14 residues (M806-E819) of VEGFR2 SOP are completely disordered 
and the next seven residues (L820-R826) form an extended loop structure. It is likely 
that residues 806-819 do not form part of the active kinase region but are instead part 
20 of, or are adjacent to, the juxtamembrane region of VEGFR-2. Residues 820-826 do 
seem to be part of the kinase domain, although a flexible one, as analogous residues 
are also ordered in the structures of FGFR1 , IRK, and the non-receptor tyrosine 
kinase Lck (Yamaguchi and Hendrickson, (1996) Nature 384, 484-489). Other 
differences between the VEGFR2 SOP structure and other kinase structures occur at 
25 the kinase insert domain and the activation loop (discussed below). 

Catalytic loop and ATP binding site 

In protein kinases, the loop between E and 7 has been termed the catalytic 
30 loop as it contains an invariant aspartic acid (D1028) that is believed to function as a 
catalytic base in the phosphotransfer reaction (Johnson et al., 1996). This aspartic 
acid is part of a stretch of residues (H1026-N1033) whose sequence HRDLAARN is 
highly conserved among protein tyrosine kinases. In VEGFR2 50P the backbone 
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position and most sidechain positions of this loop are similar to those in the 
unliganded FGFR1 and ternary phosphorylated IRK (!RKP) complex structures. As 
seen in these previous structures the sidechain carboxylate of the catalytic loop 
aspartic acid (D1028) is hydrogen bonded to the sidechains of the conserved arginine 
5 (R1032) and asparagine (N1033) (Figure 3). 

The ATP binding site of protein kinases lies at the cleft between the N and C- 
terminal lobes (Figure 2c). For VEGFR2 50P, the residues forming this site consist 
primarily of residues E917-N923, joining the two lobes, and residues L840-I849 which 

10 include parts of 1 , 2, and the glycine-rich loop of G841-G846. The glycine-rich loop, 
also referred to as the nucleotide binding loop, is a flexible segment whose position 
differs among kinase structures in various activated and liganded states. In 
VEGFR2 SOP this loop is fairly well ordered and all atoms could be modeled with the 
exception of the sidechains of R842 and F845. The relative position and 

15 conformation of this loop is similar to that observed in the unligated FGFR1 structure. 
However, this position is markedly different from that in the IRKP ternary complex 
structure in which the approximately 20° relative rotation of the N and C-terminal 
lobes results in the glycine-rich loop being 5 A closer to the C-terminal lobe than in 
VEGFR2 SOP structure. 

20 

In reported kinase structures with bound ATP or an ATP analog, the adenine 
ring makes two conserved hydrogen bonds with the protein backbone. In the 
structure of FGFR1 with AMP-PCP bound (Mohammadi et al M 1996) these hydrogen 
bonds are between the adenine NH2 and the backbone C=0 of E562 (E917 
25 VEGFR2 SOP) and between the adenine N1 and the backbone NH of A546 (C919 
VEGFR2 50P). Although the structure presented here does not contain a bound 
nucleotide, the similarities in the positions of these backbone atoms to those in 
FGFR1 indicate that these hydrogen bonds would be formed in a VEGFR2 50P-ATP 
complex and therefore the adenine is expected to bind in a similar position (Figure 4). 

30 

Variation in the ATP-binding sites of kinases involved in disease is of 
considerable importance in the design of selective ATP-competitive inhibitors as 
therapeutics. A comparison of the ATP binding sites of FGFR1 and VEGFR2 50P 
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reveals that while the overall architecture of the site is conserved, several sequence 
differences result in differences in the shape of the accessible area for ligand binding. 
Specific sequence differences between FGFR1 and VEGFR-2 in this site include: 
V899 (1545 FGFR1), F918 (Y563 FGFR1), C919 (A564 FGFR1), and C1045 (A640 
5 FGFR1 ) (Figure 4). Similarly, comparison to the ternary IRKP complex structure 
reveals variation in the adenine site at V916 (M1076 IRK), F91 8(11 078), C919 
(M1079 IRK), L1035 (M1139 IRK), and C1045 (G1149 IRK). Even greater sequence 
and structural variation in the adenine site is seen when the VEGFR2 SOP structure is 
compared to serine/threonine kinase structures, suggesting that these differences are 
10 useful in the design of selective ATP-competitive inhibitors. 

Activation loop 

Protein kinases contain a large flexible loop, referred to as the activation loop 
15 (A-loop) whose conformation is postulated to regulate kinase activity (Figure 2). In 
many kinases the conformation of the AL is controlled by the phosphorylation of 
specific A-loop residues (Johnson et aL, 1996). The loop can be generally defined as 
beginning with the conserved residues DFG and ending at the conserved APE 
sequence (Johnson et al., 1996). In VEGFR-2 this segment corresponds to D1046- 
20 E1075 and contains two tyrosines (Y1054 and Y1059). Both Y1054 and Y1059 were 
found to be autophosphorylation sites when the cytosolic domain of VEGFR-2 was 
expressed in E. coli (Dougher-Vermazen et al., 1994). Using the in vitro 
autophosphorylation protocol described here for VEGFR2 50, a stable phosphorylation 
site is indicated at Y1059, however no evidence of phosphorylation of Y1054 was 
25 detected. 

In this unliganded VEGFR2 50P structure presented here, the A-loop appears 
quite mobile and interpretable electron density was not present for most of the central 
portion of the loop (G1048-G1063). This disorder is consistent with mobility of the A- 
30 loop deduced from other kinase structures. For example, of the two molecules in the 
asymmetric unit of the unphosphorylated FGFR1 kinase structure the center of the A- 
loop has relatively high temperature factors in molecule A and is completely 
disordered in molecule B. Although residues 1048-1063 could not be modeled in 
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VEGFR2 SOP, unambiguous electron density was present for residues D1064-E1075, 
dearly indicating that these residues adopt a conformation similar to that observed in 
the unphosphorylated FGFR1 structure. The segment of D1064-P1Q68 has an 
extended structure that lies adjacent to the catalytic residues D1028 and R1032 
5 (Figure 3a). Comparison to the structure of the (MgAMP-PNP)-peptide-IRKP 
complex structure indicates that the position of R1066-P1068 in this VEGFR2 SOP 
structure is inhibitory to substrate binding. P1066 occupies equivalent space 
allocated to the tyrosine sidechain of the peptide substrate in the ternary IRK3P 
complex structure. The conformation of residues L1069-E1075 is similar to that in the 
10 ternary IRKP complex structure, however there is a complete directional change at 
P1068 (P1 172 IRK) between the two structures. In the IRK structure residues N- 
terminal to this proline are directed toward EF while in VEGFR2 50P they are directed 
toward D on the opposite side of the protein (Figures 2 and 3). 

1 5 Despite the phosphorylation of Y1 059 prior to crystallization, the conformation 

seen here for residues D1064-P1068 is similar to the inhibitory conformation 
observed for analogous residues in the unphosphorylated FGFR1 structure. Y1059 
in VEGFR2 50 corresponds to a relatively conserved phosphorylation site among 
protein tyrosine kinases. In the ternary IRKP complex structure and the 

20 phosphorylated lymphocyte kinase (Lck) structure (Yamaguchi and Hendrickson, 
1996) the tyrosine at this position (Y1 163 IRK, Y394 Lck) is phosphorylated and the 
A-loop has a non-inhibitory conformation similar to that observed in a phosphorylated 
cAPK ternary complex structure (Zheng et al., 1993). The interactions the phosphate 
group at this position makes with other protein residues are believed to help stabilize 

25 an A-loop conformation that allows substrate and ATP binding (Johnson et aL, 1996; 
Hubbard, 1997). However, since this VEGFR2 50P structure described here does not 
exhibit a similar open A-loop conformation but rather has an inhibitory conformation 
with much of the loop disordered it is possible that the monophosphorylated A-loop of 
VEGFR2 SOP exists in a dynamic equilibrium involving several conformations and that 

30 the conformation observed here is the one most favored in this crystal environment. 
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Kinase Insert Domain: 

The kinase insert domain occurs in the kinase C-terminal lobe and connects 
helices D and E. In VEGFR-2 this region corresponds to a 68 residue stretch from 
5 N933 to L1000 (Figure 1). The lack of effect on intrinsic kinase activity (noted above) 
of deletion of residues T940-E989 is perhaps not surprising as the ends of the KID 
domain occur relatively far away (approximately 35-40 A) from the catalytic site and 
on the opposite side of the protein from the position of the activation loop (Figure 2). 
These results are consistent with those for the CSF-1 receptor kinase in which 

10 deletion of 58 of the 64 residues of the CSF-1 KID only decreased it's ability to 

phosphorylate a peptide substrate by 10% (Taylor et ah, 1989). Deletion of the entire 
98 residues of PDGFR, however, resulted in an 80% decrease in kinase activity 
towards a peptide substrate (Severinsson et al., (1990) Mol. Cell. Biol. 10, 801-809). 
Thus, the present invention allows for the production of a synthetic catalytic linker 

15 which recognizes that the majority of KID is not required for catalysis but rather only a 
small number of residues must be present to form a linker between aD and aE so as 
to maintain a competent kinase structure. 

In the VEGFR2 SOP structure following D, residues N933-P937 form a loose 
20 turn and an extended strand whose ends are roughly perpendicular to the axes of D 
and I at the C-terminus. In different Fourier maps, the electron density is strong and 
clear for residues N933-P937 and becomes weak for Y938 and K939 (sidechains of 
Y938 and K939 are not modeled) (Figure 5). The 50 residue deletion in VEGFR2 50 
directly follows K939 so that the residue immediately C-terminal to K939 is V990, 
25 maintaining the residue numbering in full-length VEGFR-2. Residues V990-K997 are 
disordered and interpretable electron density begins again at D998. Residues D998- 
T1001 then form a short strand that joins E at residue L1002 (Figures 5 and 6). 

The two strands at the N-terminal and C-terminal ends of the KID form a 
30 pseudo two-stranded parallel -sheet structure that is different from the conformations 
seen in this region of other kinase structures. The two ends of the KID thus make a 
variety of interactions which may help to stabilize the overall conformation and 
position of this domain in VEGFR-2. The sidechain of K93 1 makes an ionic 
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interaction with the sidechain of E934 and also makes a hydrogen bond to the 
backbone carbonyl of D998 (Figure 6). Hydrogen bonding interactions between the 
strands include: E934 backbone C=0 to L1000 NH, V936 NH to L1000 C=0, and 
P937 C=0 to L1002 NH. In addition to these polar interactions, the sidechains of 
5 F935, P937, and L1000 are involved in extensive hydrophobic contacts. The 
sidechain of F935 is nestled in a hydrophobic pocket formed by the sidechains of 
L928, P937, L1000, L1002, L1005, L1 101, and Y1130 (Figures 5 and 6). The L1000 
sidechain also packs against the sidechains of Y927, K931, H1004, and Y1008. 



10 It has been found by the applicants that deletion of portions of the KID also 

impart other useful and desirable characteristics to the modified VEGFR-2 
polyprotein. The modified polypeptide has exhibited greater stability when exposed to 
higher temperatures in solution than the wild-type protein. Additionally, the modified 
polypeptide has also exhibited improved solubility than the wild-type protein. It is 

1 5 apparent to those skilled in the art that these properties allow improvements in 

various commercial aspects of the present invention. Examples of potential uses for 
the modified proteins include high-throughput screening of potential ligands for the 
receptor by various methods including those based on gene-chip technology 
(Affymax, lnc f ) phage-display peptide libraries (The Ph.D. Kit® by New England 

20 BioLabs, Inc.) as well as in-depth analysis via FT-NMR. 



It is therefore contemplated that the entire KID can be deleted and retain 
some catalytic activity in other related RTKs such as but not limited to PDGF and 
and other previously mentioned proteins. Furthermore, in one embodiment of the 
25 invention the entire KID is deleted and replaced with a synthetic catalytic linker of at 
least one amino acids such that both the catalytic activity and the crystallizability of 
the protein is retained. 



Cloning of The PDGFR Protein 

In this example, the PDGFR polyprotein is cloned using the methods outlined 
for VEGFR-2 above. The coding sequence for PDGFR is derived from the sequence 
disclosed by MatsuiJ., et al., (1989) Science 243: 800-804 (Accession No. 66814). 
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PCR oligonucleotide primers are then made which code for residues located in the 
cytoplasmic domain and the catalytic domain of the protein. The catalytic domain of 
PDGFR is shown starting at residue 689 (N) and ending at residue 791 (T) in Figure 
1. 

The remainder of the cloning and purification steps would be similar to those 
disclosed for the VEGFR2 50 protein and use technology well known to those skilled 
in the art. 

It is contemplated that other members of the RTK family and other uses for 
the data disclosed herein and are not limited by the examples shown. 
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WHAT IS CLAIMED !S: 

1 . A method for assaying a candidate compound for its ability to interact with a 
modified receptor tyrosine kinase (RTK) polypeptide comprising: 

a) expressing an isolated DNA sequence or variants thereof encoding the 
modified RTK gene construct wherein said RTK gene contains a synthetic 
catalytic linker wherein said linker comprises at least one amino acid from the 
kinase insert domain (KID) of the VEGFR-2 gene catalytic region, in a host 
capable of producing a form of the polypeptide which form may be assayed for 
interaction of said polypeptide with said candidate substance; 

b) exposing said modified polypeptide to said candidate substance; and 

c) evaluating the interaction of said polypeptide with said candidate substance. 

2. The method of claim 1 , wherein said evaluation step further comprises: 

(a) crystallizing said modified polypeptide in a condition suitable for x-ray 
crystallography; and 

(b) conducting said x-ray crystallography on said polypeptide. 

3. A method for assaying a candidate compound for its ability to interact with a 
modified VEGFR-2 receptor polypeptide comprising: 

a) expressing an isolated DNA sequence or variants thereof encoding the 
modified VEGFR-2 gene construct wherein said VEGFR-2 gene contains a 
synthetic catalytic linker wherein said linker comprises at least one amino acid 
from the kinase insert domain (KID) of the VEGFR-2 gene catalytic region, in a 
host capable of producing a form of the polypeptide which form may be assayed 
for interaction of said polypeptide with said candidate substance; 

b) exposing said modified polypeptide to said candidate substance; and 
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c) evaluating the interaction of said modified polypeptide with said candidate 
substance. 

4. The method of claim 2, wherein said evaluation step further comprises: 

(a) crystallizing said modified polypeptide in a condition suitable for x-ray 
crystallography; and 

(b) conducting said x-ray crystallography on said polypeptide. 

5. An isolated DNA sequence or variants thereof encoding a modified RTK gene 
construct wherein said RTK gene contains a synthetic catalytic linker wherein 
said linker comprises at least one amino acid from the kinase insert domain of 
the RTK gene catalytic region. 

6. An isolated DNA sequence or variants thereof encoding a modified VEGFR-2 
gene construct wherein said VEGFR-2 gene contains a synthetic catalytic 
linker wherein said linker comprises at least one amino acid from the kinase 
insert domain of the VEGFR-2 gene catalytic region. 

7. The isolated oligonucleotide sequence of claim 6 comprising a DNA sequence 
or variants thereof in SEQ. ID NO. 5. 

8. The isolated oligonucleotide sequence of claim 6 comprising a DNA sequence 
or variants thereof in SEQ. ID NO. 6. 

9. A method of assessing compounds which are agonists or antagonists of the 
activity of the a modified RTK gene polypeptide wherein said modified RTK 
gene contains a synthetic catalytic linker wherein said linker contains at least 
one amino acid from the kinase insert domain of the RTK polypeptide catalytic 
region comprising: 

a) crystallizing said modified RTK polypeptide; 
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b) obtaining crystallography coordinates for said crystallized modified RTK 
polypeptide; 

c) applying said crystallography coordinates for said modified RTK polypeptide 
to a computer algorithm such that said algorithm will generate a model of said 
RTK polypeptide suitable for use in designing molecules that will act as 
agonists or antagonists to said polypeptide; and 

d) applying and iterative process whereby various molecular structures are 
applied to said computer generated model to identify potential agonists or 
antagonists to said polypeptide. 

10. A method of assessing compounds which are agonists or antagonists of the 
activity of the a modified VEGFR-2 gene polypeptide wherein said modified 
VEGFR-2 gene contains a synthetic catalytic linker wherein said linker 
comprises at least one amino acid from the kinase insert domain of the 
VEGFR-2 polypeptide catalytic region comprising: 

a) crystallizing said modified VEGFR-2 polypeptide; 

b) obtaining crystallography coordinates for said crystallized modified VEGFR- 
2 polypeptide; 

c) applying said crystallography coordinates for said modified VEGFR-2 
polypeptide to a computer algorithm such that said algorithm will generate a 
model of said VEGFR-2 polypeptide suitable for use in designing molecules 
that will act as agonists or antagonists to said polypeptide; and 

d) applying and iterative process whereby various molecular structures are 
applied to said computer generated model to identify potential agonists or 
antagonists to said polypeptide. 
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1 1 . The method of claim 10, wherein said modified VEGFR-2 polypeptide 
comprises the VEGFR2 50 polypeptide of Seq. ID No. 5. 

12. An isolated DNA sequence comprising a DNA sequence or variants thereof 
encoding the VEGFR-2 gene construct having the x-ray coordinates of Figure 
5. 

13. A method for preparing proteins or polypeptides of the receptor tyrosine 
kinase family such that they are suitable for measurement by x-ray 
crystallography comprising: 

a) identification of the Kinase Insert Domain within the catalytic domain of said 
proteins; 

b) deletion of a specific number of amino acid residues from said Kinase 
Insert Domain such that the modified polypeptide now has a stable 
conformation such that it may form a crystalline state suitable for being 
measured by x-ray crystallography; and 

c) crystallizing said modified polypeptide. 

14. A process of drug design for compounds which interact with RTK polypeptides 
comprising: 

a) deletion of a portion of the KID of the target RTK polypeptide; 

b) crystallizing said target RTK polypeptide; 

c) resolving the x-ray crystallography of said target RTK polypeptide; 

d) applying the data generated from resolving the x-ray crystallography of said 
target RTK polypeptide to a computer algorithm which will generate a model of 
said target RTK polypeptide suitable for use in designing molecules that will 
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act as agonists or antagonists to said polypeptide; and 

e) applying an iterative process whereby various molecular structures are 
applied to said computer generated model to identify potential agonists or 
antagonists to said target RTK polypeptide. 

15. A process of drug design for compounds which interact with modified VEGFR- 
2 polypeptides comprising: 

a) deletion of a portion of the KID of the modified VEGFR-2 polypeptide; 

b) crystallizing said modified VEGFR-2 polypeptide; 

c) resolving the x-ray crystallography of said modified VEGFR-2 polypeptide; 

d) applying the data generated from resolving the x-ray crystallography of said 
modified VEGFR-2 polypeptide to a computer algorithm which will generate a 
model of said modified VEGFR-2 polypeptide suitable for use in designing 
molecules that will act as agonists or antagonists to said polypeptide; and 

e) applying an iterative process whereby various molecular structures are 
applied to said computer generated model to identify potential agonists or 
antagonists to said modified VEGFR-2 polypeptide. 

16. The method of claim 15, wherein said modified VEGFR-2 polypeptide 
comprises the VEGFR2 50 polypeptide of Seq. ID No. 5. 
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Figure 1 



nudeotide-binding 

oB pi ^i^P2 

VEGF-R2 806 MDPDELPLDEHCERLPYDASKWEFPRDRLKLGKPLGRGAFGQVIEADAFGIDKTATCR-- 863 

FGFR1 456 — MLAGVSEY--ELP-EDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRVT 509 

IRK 978 VFPCSVWPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAE— T 1025 

VEGF-R1 799 MDPDEVPLDEQCERLPYDASKWEFARERLKLGKSLGRGAFGKVVQASAFGIKKSPTCR-- 856 

PDGFRa 576 DPMQLPYD-SRWEFPRDGLVLGRVLGSGAFGKVVEGTAYGLSRSQPVM-- 622 

63 aC S4 B5 

mma^^m wmmmmmtmmmmmmmmm m^m^mm m^mmm 

VEGF-R2 864 TVAVKMLKEGATHSEHRALMSELKILIHIGHHLNVVNLLGACTKPGGPLMVIVEFCKFGN 923 
FGFR1 510 KVAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACT-QOGPLYVIVEYASKGN 568 
IRK 1026 RVAVKTVNESASLRERIEFLNEASVMKGFTCH-HVVRLLGVVSK-GQPTLVVMELMAHGD 1083 
VEGF-R1 857 TVAVKMLKEGATASEYKALMTEIKILTHIGHHLNVVNLLGACTKQGGPLMVIVEYCKYGN 916 
PDGFRa 623 KVAVKMLKPTARSSEKQALMSELKIMTHLGPHLNIVNLLGACTK-SGPIYIITEYCFYGD 681 
OtO kinase insen domain 

VEGF-R2 924 ISTYLRSKRNEFVPYKTKGARFRQGKDYVG 953 

FGFR1 569 LREYLQARRPPGLEYCYN - - — 586 

IRK 1084 LKSYLRSLRPEA 1095 

VEGF-Ri 917 LSNYLKSKRDLFFLNKDAALHMEPKKEKME - 946 

PDGFRa 682 LVNYLHKNRD5FLSKHPEKPKKELDIFGLNPADESTRSYVILSFENNGDYMDMKQADTTQ 741 

V; aE • 

VEGF-R2 954 — AIPVDIKRRLDSITSSQSSASSGFVEEKSISDVEEEEAPEDLYKDFLTLEHLICYSF 1010 

FGFR1 587 PSHNPEEQLSSKDLVSCAY 605 

IRK 1096 --- ENNPGRPPPTLQEMIQMAA 1114 

VEGF-RI 947 --PGLEQGKKPRLDSVTSSESFASSGFQEDKSLSDVEEEEDSDGFYKEPITMEDLISYSF 1004 

PDGFRa 742 YVPMLERKEVSKYSDIQRSLYDRPASYKK-KSMLDSEVKNLLSDDNSEGLTLLDLLSFTY 800 

^catalytic loop^ pg activation loop '-■ - 

VEGF-R2 1011 QVAKGMEFLASRKCIHRDLAARNILLSEKNVVKICDFGLARDIYKDPDYVRKGDARLPLK 1070 

FGFR1 606 QVARGMEYLASKKCIHRDLAARNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVK 665 

IRK 1115 EIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVR 1174 

•VEGF-RI 1005 QVARGMEFLSSRKCIHRDLAARNILLSENNVVKICDFGLARDIYKNPDYVRKGDTRLPLK 1064 

PDGFRa 801 QVARGMEFLASKNCVHRDLAARNVLLAQGKIVKICDFGLARDIMHDSNYVSKGSTFLPVK 860 

,aEF aF aG 

VEGF-R2 1071 WMAPETIFDRVYTIQSDVWSFGVLLWEIFSLGASPYPGVKIDEEFCRRLKEGTRMRAPDY 1130 

FGFR1 666 WMAPEALFDRIYTHQSDVWSFGVLLWEIFTLGGSPYPGVPVEEIF-KLLKEGHRMDKPSN 724 

IRK 1175 WMAPESLKDGVFTTSSDMWSFGVVLWEITSIAEQPYQGLSNEQVL-KFVMDGGYLDQPDN 1233 

VEGF-RI 1065 WMAPESIFDKIYSTKSDVWSYGVLLWEIFSLGGSPYPGVQMDEDFCSRLREGMRMRAPEY 1124 

PDGFRa 861 WMAPESIFDNLYTTLSDVWSYGILLWEIFSIGGTPYPGMMVDSTFYNKIKSGYRMAKPDH 920 
ocH al 

VEGF-R2 1131 TTPEMYQTMLDCWHGEPSQRPTFSELVEHLGNLIQANAQQD 1171 

FGFR1 725 CTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRIVALTSNQE 765 

IRK 1234 CPERVTDLMRMCWQFNPNMRPTFLEIVNLLKDDLHPSFPEV 1274 

VEGF-RI 1125 STPEIYQIMLDCWHRDPKERPRFAELVEKLGDLLQANVQQD 1165 

PDGFRa 921 ATSEVYEIMVKCWNSEPEKRPSFYHLSEIVENLLPGQYKKS 961 
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<1I0> McTigue Ph.D., Kichele A. 

<120> Modifications of The VEGF Receptor-2 Protein And 
Methods Of Use 

<130> Modifed VEGFR-2 protein 

<140> 0125-0016 
<141> 1998-09-08 

<160> 1 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 367 
<212> PRT 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Human Aorta cDNA 
library 

<400> 1 

Met Asp Pro Asp Glu Leu Pro Leu Asp Glu His Cys Glu Arg Leu Pro 
15 10 15 

Tyr Asp Ala Ser Lys Trp Glu Phe Pro Arg Asp Arg Leu Lys Leu Gly 
20 25 30 

Lys Pro Leu Gly Arg Gly Ala Phe Gly Gin Val He Glu Ala Asp Ala 
35 40 45 

Phe Gly He Asp Lys Thr Ala Thr Cys Arg Thr Val Ala Val Lys Met 
50 55 60 

Leu Lys Glu Gly Ala Thr His Ser Glu His Arg Ala Leu Met Ser Glu 
65 70 75 80 

Leu Lys He Leu He His He Gly His His Leu Asn Val Val Asn Leu 
85 90 95 

Leu Gly Ala Cys Thr Lys Pro Gly Gly Pro Leu Met Val He Val Glu 
100 105 HO 

Phe Cys Lys Phe Gly Asn Leu Ser Thr Tyr Leu Arg Ser Lys Arg Asn 
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115 120 125 

Giu Phe Vai Pro Tyr Lys Thr Lys Gly Ala Arg Phe Arg Gin Gly Lys 
130 135 140 

Asp Tyr Val Gly Ala He Pro Val Asp Leu Lys Arg Arg Leu Asp Ser 
145 150 155 160 

He Thr Ser Ser Gin Ser Ser Ala Ser Ser Gly Phe Val Glu Glu Lys 
165 170 175 

Ser Leu Ser Asp Val Glu Glu Glu Glu Ala Pro Glu Asp Leu Tyr Lys 
180 185 190 

Asp Phe Leu Thr Leu Glu His Leu Leu He Cys Tyr Ser Phe Gin Val 
195 200 205 

Ala Lys Gly Met Glu Phe Leu Ala Ser Arg Lys Cys He His Arg Asp 
210 215 220 

Leu Ala Ala Arg Asn He Leu Leu Ser Glu Lys Asn Val Val Lys He 
225 230 235 240 

Cys Asp Phe Gly Leu Ala Arg Asp He Tyr Lys Asp Pro Asp Tyr Val 
245 250 255 

Arg Lys Gly Asp Ala Arg Leu Pro Leu Lys Trp Met Ala Pro Glu Thr 
260 265 270 

He Phe Asp Arg Val Tyr Thr He Gin Ser Asp Val Trp Ser Phe Gly 
275 280 285 

Val Leu Leu Trp Glu He Phe Ser Leu Gly Ala Ser Pro Tyr Pro Gly 
290 295 300 

Val Lys He Asp Glu Glu Phe Cys Arg Arg Leu Lys Glu Gly Thr Arg 
305 310 315 320 

Met Arg Ala Pro Asp Tyr Thr Thr Pro Glu Met Tyr Gin Thr Met Leu 
325 330 335 

Asp Cys Trp His Gly Glu Pro Ser Gin Arg Pro Thr Phe Ser Glu Leu 
340 345 350 

Val Glu His Leu Gly Asn Leu Leu Gin Ala Asn Ala Gin Gin Asp 
355 360 365 
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This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



I I x] As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. Q J As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. [ | No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest j | The additional search fees were accompanied by the applicant's protest. 

j | X| No protest accompanied the payment of additional search fees. 
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A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (7): 

C07K LOO, 14/00, 17/00; C12N 9/00. 9/12; CUP 21/06; C30B 7/00, 21/02, 28/06; G01N 31/00, 33/00, 33/53, 33/567 

BOX I. OBSERVATIONS WHERE CLAIMS WERE FOUND UNSEARCHABLE 
2. Where no meaningful search could be carried out, specifically: 

Claims 7-8, 1 1 and 16 are unsearchable to the extent that they require reference to the specified sequences from the 
sequence listing. Because Applicant has not included these sequences on the submitted machine-readable copy of the 
sequence listing, no meaningful search of the sequences per se can be carried out by this Authority. However, the 
subject matter of the claims has been searched to the extent possible with reference to the balance of the description. 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

Group I. claim(s)l-l2. drawn to an isolated polynucleotide, a polypeptide, and a method of assaying compound 
interaction with the polypeptide. 

Group II, claim 13, drawn to method of preparing proteins for crystallization. 
Group III, claim(s) 14-16, drawn to a process for drug design. 

The inventions listed as Groups l-HI do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

The products/processes of Groups Mil are separate and patentably distinct inventions in that the polynucleotide, 
polypeptide and method of compound assay of Group I. the method of preparing proteins for crystallization of Group II, 
and the process of drug design in Group 111 do not require each other for their practice and have separate functions alt of 
which constitute the special technical features which define the contribution of each invention. Since these special 
technical features are not shared by each product, the inventions of Groups l-III do not form a single inventive concept 
within the meaning of Rule 13.2. 
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